Model Selection

Robust speech processing

# Robust speech processing

Wav2vec2 Large Robust 6 Ft Age Gender Finetuned Gtzan

An audio classification model based on the wav2vec2 architecture, fine-tuned on the privateSLI dataset for age and gender recognition tasks

Audio Classification

languageresearch

Wav2vec2 Xls R 300m Indonesian

An automatic speech recognition model fine-tuned on Indonesian speech data based on Facebook's XLS-R-300M model

Speech Recognition

Transformers Other

Wav2vec2 Xls R 1b Korean

This model is a Korean automatic speech recognition model fine-tuned on the KRESNIK/ZEROTH_KOREAN - CLEAN dataset based on facebook/wav2vec2-xls-r-1b

Speech Recognition

Transformers Korean

Xls R 300 Sv Cv7

This is an automatic speech recognition model fine-tuned on the Swedish Common Voice 7.0 dataset based on facebook/wav2vec2-xls-r-300m

Speech Recognition

Transformers Other

patrickvonplaten

Wav2vec2 Large Xlsr 53 Demo Colab

This model is a speech recognition model fine-tuned on the common_voice dataset based on facebook/wav2vec2-large-xlsr-53, primarily used for robust speech event recognition.

Speech Recognition

Wav2vec2 Large Xls R 300m Romansh Sursilvan

Automatic speech recognition model fine-tuned on the Romansh Sursilvan dialect dataset based on facebook/wav2vec2-xls-r-300m

Speech Recognition

Wav2vec2 Large Xls R 300m Hindi

This is a Hindi speech recognition model fine-tuned on Hindi datasets based on the facebook/wav2vec2-xls-r-300m model, supporting Hindi speech-to-text tasks.

Speech Recognition

Transformers Other

Wav2vec2 Large Xls R 300m Latvian

This is an automatic speech recognition model fine-tuned on Latvian datasets based on facebook/wav2vec2-xls-r-300m, achieving a WER of 16.98% on the Common Voice 7 test set.

Speech Recognition

Transformers Other

Wav2vec2 Large Xlsr Coraa Portuguese Cv7

Portuguese speech recognition model fine-tuned on the Common Voice dataset based on Edresson/wav2vec2-large-xlsr-coraa-portuguese

Speech Recognition

Transformers Other

Wav2vec2 Xls R 300m Turkish Tr Small

This is a Turkish speech recognition model fine-tuned on the Common Voice dataset based on the facebook/wav2vec2-xls-r-300m model

Speech Recognition

Wav2vec2 Indonesian Javanese Sundanese

This is a multilingual speech recognition model supporting Indonesian, Javanese, and Sundanese, fine-tuned from facebook/wav2vec2-large-xlsr-53.

Speech Recognition

Transformers Other

Wav2vec2 Xls R Pt Cv7 From Bp400h

This is a Portuguese automatic speech recognition (ASR) model based on the wav2vec2 XLS-R architecture, fine-tuned on the Common Voice 7 dataset, achieving a word error rate (WER) of 12.13% on the test set.

Speech Recognition

Transformers Other

Wav2vec2 Large Xls R 1b Indonesian

An automatic speech recognition model fine-tuned on the Common Voice Indonesian dataset based on facebook/wav2vec2-xls-r-1b

Speech Recognition

Transformers Other

This is a Telugu automatic speech recognition (ASR) model fine-tuned based on the facebook/wav2vec2-xls-r-2b model, trained on the OpenSLR SLR66 dataset

Speech Recognition

Transformers Other

This is an automatic speech recognition model fine-tuned on the Common Voice 8 Dhivehi dataset based on the facebook/wav2vec2-xls-r-300m model

Speech Recognition

Transformers Other

Wav2vec2 Large Xls R 300m Sl With LM V1

This is an automatic speech recognition (ASR) model fine-tuned on the Slovenian language (Common Voice 8.0) dataset based on the facebook/wav2vec2-xls-r-300m model, with improved recognition performance through language model (LM) integration.

Speech Recognition

Transformers Other

Wav2vec2 Large Xls R 300m Hi Cv8

This is an automatic speech recognition (ASR) model fine-tuned on the Hindi Common Voice 8 dataset based on the facebook/wav2vec2-xls-r-300m model.

Speech Recognition

Transformers Other

Wav2vec2 Large Xls R 300m Cv8 Nl

An automatic speech recognition model fine-tuned on the Common Voice 8 Dutch dataset based on facebook/wav2vec2-xls-r-300m, including a 6-gram KenLM language model

Speech Recognition

Transformers Other

Wav2vec2 Large Xlsr 53 Demo Colab

This is an automatic speech recognition model based on the wav2vec2 architecture, specifically optimized for the Tamil language and supporting Nepali speech recognition tasks.

Speech Recognition

Transformers Other

Xls R Nl V1 Cv8 Lm

This is an automatic speech recognition model based on the XLS-R architecture, specifically optimized for Dutch and Flemish, incorporating a 5-gram language model to improve recognition accuracy.

Speech Recognition

Transformers Other

This model is an automatic speech recognition model fine-tuned on the Galician dataset based on facebook/wav2vec2-xls-r-300m, achieving a WER of 11.31% on the Common Voice 8.0 test set.

Speech Recognition

Transformers Other

Wav2vec2 Large Xls R 300m Sat Final

This is an automatic speech recognition model fine-tuned on the MOZILLA-FOUNDATION/COMMON_VOICE_8_0 - SAT dataset based on facebook/wav2vec2-xls-r-300m, supporting Santali (Ol Chiki) language.

Speech Recognition

Transformers Other

Wav2vec2 Large Xls R 300m Br D2

A speech recognition model fine-tuned on Breton (Common Voice 8.0) based on facebook/wav2vec2-xls-r-300m

Speech Recognition

Transformers Other

Wav2vec2 Large Xls R 1b Cv8 Mt

An automatic speech recognition model fine-tuned on the Common Voice 8 Maltese dataset based on facebook/wav2vec2-xls-r-1b

Speech Recognition

Transformers Other

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase